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Abstract 



We present a linear-time algorithm for deciding first-order logic (FOL) 
properties in classes of graphs with bounded expansion, a notion recently 
introduced by Nesetfil and Ossona de Mendez. Many natural classes of 
graphs have bounded expansion: graphs of bounded tree- width, all proper 
minor-closed classes of graphs, graphs of bounded degree, graphs with no 
subgraph isomorphic to a subdivision of a fixed graph, and graphs that 
(/3 \ can be drawn in a fixed surface in such a way that each edge crosses at 

O ^ ■ most a constant number of other edges. We deduce that there is an almost 

linear-time algorithm for deciding FOL properties in classes of graphs with 
locally bounded expansion. 
^ ' More generally, we design a dynamic data structure for graphs belong- 

\^ ' ing to a fixed class of graphs of bounded expansion. After a linear-time 

CO . initialization the data structure allows us to test an FOL property in 

' constant time, and the data structure can be updated in constant time 

I after addition/deletion of an edge, provided the list of possible edges to 

be added is known in advance and their simultaneous addition results in 
' a graph in the class. All our results also hold for relational structures and 

are based on the seminal result of Nesetfil and Ossona de Mendez on the 
existence of low tree-depth colorings. 



1 Introduction 

A celebrated theorem of Courcelle [T] states that for every integer fc > 1 and 
every property 11 definable in monadic second-order logic (MSOL) there is a 
linear-time algorithm to decide "whether a graph of tree- -width at most k satisfies 
n. While the theorem itself is probably not useful in practice because of the 
large constants involved, it does provide an easily verifiable condition that a 
certain problem is (in theory) efficiently solvable. Courcelle's result led to the 



*A preliminary version of this paper appeared in FOCS 2010. 

t Institute for Theoretical Computer Science (ITI), Faculty of Matlicmatics and Physics, 
Charles University, Prague, Czech Republic. E-mail: {rakdver .kraljOkam.mf f . cuni . cz. In- 
stitute for Theoretical Computer Science (ITI) is supported by Ministry of Education of Czech 
Republic as projects 1M0545. 

t School of Mathematics, Georgia Institute of Technology, Atlanta, GA. Partially supported 
by NSF under No. DMS-0739366. 



1 



development of a whole new area of algorithmics, known as algorithmic meta- 
theorems; see the survey [19] or the forthcoming one [18]. For specific problems 
there is very often a more efficient implementation, for instance following the 
axiomatic approach of [5S] . 

While the class of graphs of tree-width at most k is fairly large, it does 
not include some important graph classes, such as planar graphs or graphs of 
bounded degree. Courcelle's theorem cannot be extended to these classes unless 
P=NP, because testing 3-colorability is NP-hard for planar graphs of maximum 
degree at most four [T^ and yet 3-colorability is expressible in monadic second 
order logic. 

Thus in an attempt at enlarging the class of input graphs, we have to restrict 
the set of properties to be tested. One of the first result in this direction was 
a linear-time algorithm of Eppstein [111 [12] for testing the existence of a fixed 
subgraph in planar graphs. He then extended his algorithm to minor-closed 
classes of graphs with locally bounded tree- width [T3] . Since testing containment 
of a fixed subgraph can be expressed in first order logic (FOL) by a Si-sentence, 
this can be regarded as a precursor to FOL formula testing. Prior to our work, 
the following were the most general results: 

• a linear-time algorithm of Seese [30] to test FOL properties of graphs of 
bounded degree, 

• a linear-time algorithm of Frick and Grohe [15] for deciding FOL properties 
of planar graphs, 

• an almost linear-time algorithm of Frick and Grohe [T5] for deciding FOL 
properties for classes of graphs with locally bounded tree-width, 

• a fixed parameter algorithm of Dawar, Grohe and Kreutzer [2] for deciding 
FOL properties for classes of graphs locally excluding a minor, and 

• a linear-time algorithm of Nesetfil and Ossona de Mendez [5^ for deciding 
Si-properties for classes of graphs with bounded expansion. 

Our main theorem gives a common generalization of these five results. In order 
to state it we need a couple of definitions. For an integer r > 0, a graph G is 
an r-shallow minor of a graph HUG can be obtained from a subgraph of H 
by contracting vertex-disjoint subgraphs of radii at most r (and removing the 
resulting loops and parallel edges). A class Q of graphs has bounded expansion 
if there exists a function f : N ^ such that for every integer r > every 
r-shallow minor of a member of Q has edge-density at most /(r). (The edge- 
density of a graph G is |i?(G)|/|y(G)|.) A preliminary version of our main 
theorem can be stated as follows. 

Theorem 1. Let Q be a class of graphs with bounded expansion, and let H be 
a first-order property of graphs. Then there exists a linear-time algorithm that 
correctly decides whether a graph from Q satisfies 11 . 
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In fact, we prove a more general theorem: we prove there exists a hnear- 
time algorithm for L-structures "guarded" by a member of and we design 
several data structures that allow the L-structure to be modified and support 
FOL property testing in constant time. 

Using known techniques we derive the following corollary from Theorem [TJ 
A class G of graphs has locally bounded expansion if there exists a function 
g :N xN such that for every two integers d,r > 0, for every graph G G Q 

and for every v e V{G), every r-shallow minor of the d-neighborhood of w in G 
has edge-density at most g{d, r). We say that there exists an almost linear-time 
algorithm to solve a problem 11 if for every e > there exists an algorithm to 
solve n with running time ©(n^"*"^), where n is the size of the input instance. 

Corollary 2. Let Q be a class of graphs with locally bounded expansion, and let 
n be a first-order property of graphs. Then there exists an almost linear-time 
algorithm that correctly decides whether a graph from Q satisfies 11 . 

We announced our results in the survey paper [7]. Dawar and Kreutzer [3] 
posted an independent proof of Theorem [T] and a proof of Corollary [2] for the 
more general classes of nowhere-dense graphs (introduced below). However, the 
proofs in [3] are incorrect. A correct proof of Theorem [TJ different from ours, 
appears in [T8] . 

Thus it remains an interesting open problem whether Corollary [5] can be 
generalized to the more general classes of nowhere-dense graphs. This is of sub- 
stantial interest from the point of view of fixed parameter tractability, because 
nowhere density of classes of graphs gives a natural limitation (subject to a 
widely believed complexity-theory assumption). Indeed, we prove the following 
in Theorem [5] below. Let L be a language consisting of one binary relation 
symbol and let Q he & class of graphs closed under taking subgraphs that is not 
nowhere dense. We prove that if testing whether an input graph from Q satisfies 
a given Ei -sentence Lp is fixed parameter tractable when parametrized by the 
size of then FPT=W[1]. 

In the rest of this section we introduce terminology and state all our results. 

1.1 Logic theory definitions 

Our logic terminology is standard, except for the following. All function symbols 
have arity one, and hence all functions are functions of one variable. If L is a 
language, then an L-term is simple if it is a variable or it is of the form f{x) 
where / is a function symbol and a; is a variable. An L-formula is simple if all 
terms appearing in it are simple. The rest of our logic terminology is standard, 
and so readers familiar with it may skip the rest of this subsection. 

A language L consists of a disjoint union of a finite set of relation symbols 
and a finite set of function symbols. Each relation symbol R G U' \s associ- 
ated with an integer a{R) > 0, called the arity of R. In this paper all function 
symbols have arity one. 

If i is a language, then an L-structure A is a triple {V, (R^) Re l^- , (/^) feLf ) 
consisting of a finite set V and for each m-ary relation symbol R G L^ a, set 
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C V", the interpretation of R in A, and for each function symbol / G 
a function : ^ ^ ^ of one variable, the interpretation of / in A. We define 
V{A) := F. For example, graphs may be regarded as L-structures, where L is 
the language consisting of a single binary relation. Wc define the size |^| of A 
to be \V{A)\ + Y^ReL- \^^\ + ^ language L' extends a language L 

if every function symbol of i is a function symbol of L' and the same holds for 
relation symbols. If a language L' extends a language L, A is an L-structure 
and A' is an L'-structure such that V{A) = V{A') and A and A' have the same 
interpretations of symbols of L, then we say that A' is an expansion of A. 

Assume that we have an infinite set of variables. An L-term is defined as 
follows: 

1. each variable is an L-term, and 

2. if / e and t is an L-term, then f{t) is an L-term. 

Each L-term is obtained by a finite number of applications of these two rules. 
We say that an L-term is simple if it is a variable or is of the from f{x) where 
f € and x is a variable. A term t appears in a term t' if either t = t' or 
t' = f{t") for some / G and t appears in t" . 

An atomic L- formula ip is either the symbol T (which represents a tautol- 
ogy); or its negation _L; or R{t\,. . - jtm), where R is an m-ary relation symbol 
of L and ti,. . . ,tm are L-terms; or ti = ^2, where ti and t2 arc L-terms. A 
term t appears in (p if it appears in one of the terms ti, . . . ,tm- An L- formula 
is defined recursively as follows: every atomic L-formula is an L-formula, and if 
^Pl and (p2 are L- formulas and a; is a variable, then -"pi^ ipiW Lp2, 951 A (^2, 3a; ipi 
and Vx </?i are L-formulas. Every L-formula is obtained by a finite application 
of these rules. We write t\ ^ t2 as a shortcut for -^{t\ = t^)- 

A term t appears in an L-formula ipi V ip2 if it appears in cpi or (^2, and we 
define appearance for the other cases analogously. An L-formula is simple if all 
terms appearing in it are simple. A variable x appears freely in an L-formula 97 
if either ip is atomic and x appears in p; or p = ipiW <p2 ot (p = ipi /\ p2 and 
X appears freely in at least one of the formulas ipi and ip2] 01 ip = 3y p' or 
if = \/y p>' , X is distinct from y and x appears freely in ip' . Occurences of x in 
the formula p not inside the scope of a quantifier bounding x, i.e., those that 
witness that x appears freely in p, are called free and the variables that appear 
freely in ip are also referred to as free variables. If <^ is a formula, then the 
notation p{xi, . . . ,x„) indicates that all variables that appear freely in p are 
among xi, . . . , x„. An L-sentence is an L-formula such that no variable appears 
freely in it. If ip{xi, . . . ,x„) is an L-formula and A is an L-structure, then for 
Vi, . . . ,Vn G ^^(^4), WC define A |= ip{vi, . . . , Vn) in the usual way. We denote 
the length of a formula p by \p\. 

1.2 Classes of sparse graphs 

All graphs considered in this paper are simple and finite. The notion of a class of 
graphs of bounded expansion was introduced by Nesetf il and Ossona de Mendez 
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in [30] and in the series of journal papers [211 [221 US]- Examples of classes of 
graphs with bounded expansion include proper minor-closed classes of graphs, 
classes of graphs with bounded maximum degree, classes of graphs excluding 
a subdivision of a fixed graph, classes of graphs that can be embedded on a 
fixed surface with bounded number of crossings per each edge and many others, 
see [2S]. Many structural and algorithmic properties generalize from proper 
minor-closed classes of graphs to classes of graphs with bounded expansion, 
see [Tins]. 

A class G of graphs is nowhere-dense if 

log\E(G)\ , 
lim lim sup ^ ) < 1, 

r-^ook^oo Q \og\V{G)\ 

where the supremum is taken over all graphs G on at least k vertices such that 
G is an r-shallow minor of a member oi Q. It can be shown that every class of 
graphs with (locally) bounded expansion is nowhere-dense [Ml , but the converse 
is false. One can also define a "locally nowhere-dense" class of graphs, but it 
turns out that such classes are nowhere-dense [24j. 

If L is a language, then the Gaifman graph of an L-structure A is the undi- 
rected graph Ga with vertex set V{Ga) = V{A) and an edge between two 
vertices a,b ^ ^(^) (« 7^ b) if and only if there exist R U and a tuple 
(oi, . . . , a-r) S such that a,b E {ai, . . . , a^} or there exists a function f G 
such that b = f{a) or a = f{b). We say that the relational structure A is 
guarded by a graph G with vertex set V{G) — V{A) if Ga C G. Observe that 
if G belongs to a class of graphs with bounded expansion, then every subgraph 
of G has a vertex of bounded degree, and hence the number of complete sub- 
graphs of G is linear in |V^(G')| by a result of [3T]. It follows that the size \A\ of 
any L-structure A guarded by a graph belonging to a fixed class of graphs with 
bounded expansion is 0(|y(yl)|). 



1.3 Our results 

We first state versions of Theorem [H and Corollary |2l for L-structures. 

Theorem 3. Let Q be a class of graphs with bounded expansion, L a language 
and if an L-sentence. There exists a linear-time algorithm that decides whether 
an L-structure guarded by a graph G £ Q satisfies (p. 

Corollary 4. Let G be a class of graphs with locally bounded expansion, L a 
language and Lp an L-sentence. There exists an almost linear-time algorithm 
that decides whether an L-structure guarded by a graph G E Q satisfies Lp. 

Our approach differs from the methods used to prove the results from [21 HH 
[50] mentioned above and is based on a seminal result of Nesetfil and Ossona 
de Mendez [21] on the existence of low tree-depth colorings for graphs with 
bounded expansion, stated below as Theorem [T] 

We also consider dynamic setting and design the following data structures, 
where the first can be viewed as a dynamic version of Theorem [3] In the 
descriptions below we use n to denote the number of vertices of the graph G. 
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• for every class G of graphs with bounded expansion, a language L and an 
L-sentence ip, a data structure that is initialized with a graph G € Q and 
an L-structure A guarded by G in time 0{n) and supports: 

— adding a tuple to a relation of A in time 0(1) provided A stays 
guarded by G, 

— removing a tuple from a relation of A in time 0(1), 

— answering whether A \^ ip in time 0(1), 

• for every class Q of graphs with bounded expansion, an integer do and a 
language L, a data structure that is initialized with a graph G ^ Q and 
an i-structure A guarded by G in time 0{n) and supports: 

— adding a tuple to a relation of A in time 0(1) provided A stays 
guarded by G, 

— removing a tuple from a relation of A in time 0(1), 

— answering whether A \= (p for any Si-L-sentence if with at most do 
variables in time 0(|</?|) and outputting one of the satisfying assign- 
ments, and 

if Q is only a class of nowhere-dense graphs, then for every e > the data 
structure is initialized in time 0{n}~^'^) and supports: 

— adding a tuple to a relation of A in time O(n^) provided A stays 
guarded by G, 

— removing a tuple from a relation of A in time O(n^), and 

— answering whether A \^ (p for any Si-L-sentence p with at most dg 
variables in time 0(|(/3|) and if so, outputting one of the satisfying 
assignments. 

The second of these data structures is needed in our linear-time algorithm for 
3-coloring triangle-free graphs on surfaces [TOj , also see [8] . 

1.4 A hardness result 

Theorem [T] and Corollary [2] fall within the realm of fixed parameter tractability 
(FPT). We say that a decision problem 11 parametrized by a parameter t is 
fixed parameter tractable if there exists an algorithm for 11 with running time 
0{f(t)n''), where n is the size of the input, / is an arbitrary function and c is 
a constant independent of t. Analogously to the polynomial hiearchy starting 
with the classes P and NP, there exists a hiearchy of classess FPT C W[l] C 
W[2] C • ■ • of parametrized problems, where FPT is the class of problems that 
are fixed parameter tractable. See e.g. [S1[T31[27] for more details. 

Theorem 5. Let Q be a class of graphs closed under taking subgraphs. If Q 
is not nowhere- dense and the problem of deciding T^i-properties in Q is fixed 
parameter tractable, then FPT=W[1]. 
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Proof. An r-subdivision of a graph G is the graph obtained from G by subdi- 
viding every edge exactly r times. Since Q is not nowhere-dense, there exists an 
integer r such that Q contains an r-subdivision of every graph [53] . By hypoth- 
esis there exists an FPT algorithm A to decide the existence of an r-subdivision 
of the complete graph Km in an input graph from ^, where the latter problem 
is parametrized by m. This implies that testing the existence of a complete sub- 
graph of order m is fixed parameter tractable for general graphs G, because it is 
equivalent to testing whether the r-subdivision of G has a subgraph isomorphic 
to the r-subdivision of Km, and the latter can be tested using the algorithm 
A. But testing the existence of a Km subgraph is a well-known W[l]-complete 
problem [3], and hence FPT=W[1], as desired. □ 

Dawar and Kreutzer 0\ proved the related result that if Q fails to be nowhere 
dense in an "effective" way and deciding FOL properties in Q is fixed parameter 
tractable, then FPT=AW[*]. 

2 Classes of graphs with bounded expansion 

In this section, we survey results on classes of graphs with bounded expansion 
and classes of nowhere-dense graphs that we need in the paper. We say that 
the expansion of a graph G is bounded by a function g : N ^ K."*" if for every 
integer r > every r-shallow minor of G has edge-density at most g{r). We say 
that a class of graphs Q has expansion bounded by g if every member of Q has 
expansion bounded by g. Thus Q is a class of graphs of bounded expansion if 
and only if its expansion is bounded by some function N — M"*" . 

A rooted forest _F is a directed graph such that every weak component is 
an out-branching (a rooted tree with every edge directed away from the root). 
The depth of F is the number of vertices of a longest directed path in F. The 
closure of F is the undirected graph with vertex-set V{F) and edge-set all pairs 
of vertices joined by a directed path in F. The tree-depth of an (undirected) 
graph G is the smallest integer s such that G is a subgraph of the closure of 
a rooted forest of depth s. For an integer d > 1 a vertex coloring of a graph 
G is a low tree- depth coloring of order d if for every s — 1, 2, . . . , d the union 
of any s color classes induces a subgraph of G of tree-depth at most s. Thus, 
in particular, every low tree-depth coloring of order d of G is a proper coloring 
of G. 

The existence of a low tree-depth coloring with bounded number of colors is 
one of the major structural results for graphs with bounded expansion. Before 
we state this result formally, we have to introduce more definitions. 

Let D he a. directed graph, and and let G' be the graph obtained from the 
underlying undirected graph of D by adding all edges xy such that: 

• there exists a vertex z such that D contains an edge oriented from x to z 
and an edge oriented from z to y (transitivity), or 

• there exists a vertex z such that D contains an edge oriented from x to z 
and an edge oriented from y to 2: (fraternality) . 
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We call G' the augmentation of D. The following is a mild strengthening of a 
result of Nesetfil and Ossona de Mendez [21], but the proof is adequate. 

Theorem 6. There exist functions fi and /2 with the following property. Let D 
he a directed graph with maximum in-degree at most A such that its underlying 
undirected graph has expansion hounded by g, and let G' he the augmentation of 
D. Then the expansion of G' is hounded hy the function g'{r) = /i(g(/2(r)), A). 

Let G be a graph from a class Q with bounded expansion. We denote by 
Vo(G) the maximum edge density among the subgraphs of G. Thus every sub- 
graph of G has a vertex of degree at most 2Vo(G), and hence G has an orienta- 
tion with maximum in-degree at most 2Vo(G). Consider the following sequence 
of graphs: Let Go '■— G and assume that we have constructed Go, Gi, . . . , Gk-i- 
Let D be a directed graph of maximum in-degree at most 2Vo(Gfc_i) + 2 such 
that Gk-i is its underlying undirected graph, and let Gk be the augmentation 
of D. We say that Gk is a k-th augmentation of G. By Theorem [51 the class Gk 
consisting of the graphs Gk obtained as described above starting from graphs 
in G has bounded expansion. It follows that 2Vo(Gfe) < d for some constant d 
depending only on the class Q and fc, and thus Gk is c?-degenerate. In particular, 
the chromatic number of a k-th augmentation of a graph G G is bounded by 
a constant that depends on Q only. 

If G is an orientation of a graph, then any proper coloring of its (Sd'^ -I- l)-th 
augmentation G' of G is a low tree-depth coloring of order d .21[, also see [7] 
for further details. Moreover, the subgraph H of G' induced by the vertices of 
any s < d color classes contains the underlying undirected graph of a rooted 
forest F of depth at most s such that H is a subgraph of the closure of F. 
We refer to this property of the augmentation as depth- certifying and call F a 
depth- certifying forest. The next theorem from [21 follows. 

Theorem 7. Let Q he a class of graphs with bounded expansion and d > 1 
an integer. There exist integers k and K such that any proper coloring of a 
k-th augmentation of a graph G G G is a low tree-depth coloring of order d and 
the k-th augmentation is depth- certifying. In particular, G has a low tree-depth 
coloring of order d with at most K colors. Moreover, such a coloring of G ^ Q 
and the corresponding depth- certifying forests can he found in linear time. 

Similarly, for classes of nowhere dense graphs, we obtain |24) : 

Theorem 8. Let Q he a class of nowhere-dense graphs and d > \ an integer. 
There exists k such that any proper coloring of a k-th augmentation of a graph 
G G G is a low tree-depth coloring of order d and the k-th augmentation is 
depth- certifying. In particular, for every e > the graph G has a low tree- depth 
coloring of order d with at most O(n^) colors. Moreover, such a coloring of 
G ^ G and the corresponding depth- certifying forests can he found in almost 
linear time. 
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3 Deciding FOL properties in linear time 



In this section, we prove Theorem |3l We start with a lemma which allows us to 
remove quantifiers from an FOL formula (Lemma I13p. However, we need more 
definitions. Let i be a language and let X be a set of L-terms. An X -template T 
is a rooted forest with vertex set V{T) equipped with a mapping ut '■ X V{T) 
such that a^^{w) ^ for every vertex w of T with no descendants. If (^9 is a 
quantifier-free L-formula, then a (p-template is an X-template where X is the 
set of all terms appearing in The depth of T is the maximum depth of a 
tree of T. Two X-templates T and T' are isomorphic if there exists a bijection 
/ : V{T) V{T') such that 

• / is an isomorphism of T and T' as rooted forests; in particular, w is a 
root of T if and only if f{w) is a root of T' , and 

• f{aT{t)) — aT'{t) for every L-term t E X. 

The number of non-isomorphic X-templates of a given depth is finite, as 
stated in the next proposition. The proof is straightforward and is left to the 
reader. 

Proposition 9. For every finite set of terms X and every integer d, there exist 
only finitely many non-isomorphic X -templates of depth at most d. 

Let L be a language and let X be a set of L-terms with variables {a:i, . . . , 
An embedding of an X-template T in a rooted forest _F is a mapping : V{T) — >■ 
V{F) such that i/(r) is a root of F for every root r of a tree of T and v is an 
isomorphism of T and the subforest of F induced by i'{V{T)). Let S be an 
L-structure guarded by the closure of F, and wi, . . . , u„ G V{S). We say that 
the embedding v is (wi, . . . ,Vn)- admissible for S if for every term t e X, we 
have v{aT(t)) = t{vi, . . . , where t{vi, . . . , v„) denotes the element of V{S) 
obtained by substituting Vi for Xi in the term t and evaluating the functions 
forming the term t (in particular, if Xi G X, then vlxi) = Vi). We will need the 
following lemma. 

Lemma 10. Let d > 1 be an integer, let F be a rooted forest of depth at most d, 
let L be a language, let f{xi, . . . , Xn) be an L-formula, let S be an L-structure 
guarded by the closure of F, and let vi,...,v„ £ ^{3)- Then there exists a 
if-template T and an embedding of T into F that is (vi, . . . ,Vn) -admissible for 
S. 

Proof. Let X be the set of all L-terms that appear in ip, and let Y be the set 
of all evaluations t{vi, . . . , u„) of all terms t{xi, . . . , Xn) from X. Let T be the 
smallest subforest of F that includes all vertices from Y and the root of every 
component of F that includes an element of Y . For a term t{xi, . . . , Xn) in X 
let arit) := t{vi, . . . , u„), and let be the identity mapping. Then T and v are 
as desired. □ 
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If F is a rooted forest, then the function p : V{F) -> V{F) is the F- 
predecessor function if p(v) is the parent of v unless u is a root of F] if w is a 
root of F, p{v) is set to be equal to v. 

We now show that it can be tested by a quantifier-free formula whether an 
embedding is admissible. 

Lemma 11 (Testing admissibility). Let d > be an integer, L a language in- 
cluding a function symbol p and X a finite set of terms with variables xi, . . ., Xn- 
IfT is an X -template of depth at most d, then there exists a quantifier-free for- 
mula S^t{xi, ■ . . ,Xn) such that for every rooted forest F and every L-structure S 
guarded by the closure of F such that p^ is the F -predecessor function in S, and 
for every n-tuple Ui, . . . , u„ £ V{S), the L-structure S satisfies ^t{vi, ■ ■ ■ , Vn) if 
and only if there exists an embedding of T in the forest F that is (wi, . . . , w„)- 
admissible for S. 

Proof. Let q : V{T) — J> V{T) be the T-predecessor function. Set £,t{xi, . . . ,a;„) 
to be the conjunction of all formulas 

• pHt) ifqHMt)) = q'^'Mt')), and 

. p\t) ^p^'{t') iiqHMt)) + q^'iarit')), 

for all pairs of not necessarily distinct terms t,t' £ X and all pairs of integers k 
and k' , < k,k' < d-\-l (note that including the formulas with t ~ t' allows for 
testing the depth of t m F). Here p'' denotes the function p iterated k times. 

It is straightforward to show that for vi,...,Vn G V{S), an (vi, . . . ,Vn)- 
admissible embedding for S of T in _F exists if and only if S" ^ Ct(wi, . . . , □ 

The following lemma will be extremely useful in the proof of Lemma [T51 
Recall that an L-term is simple if it is a variable or a function image of a 
variable, and an L-formula is simple if all terms appearing in it are simple. 

Lemma 12. Let d > be an integer, L a language, ip(xo, . . . ,Xn) a simple 
quantifier-free L- formula that is a conjunction of atomic formulas and their 
negations, and T a ip-template. There exist a language L that extends L and a 
quantifier-free L-formula Tprp{xi, . . . ,Xn) such that the following holds: 

• L is obtained from L by adding a function symbol p and finitely many 
relation symbols Ui, . . . ,Uk of arity at most one, 

• Tfrp is quantifier-free and the variables xi,...,Xn are the only variables 
that appear inTprp, butTpj^ need not be simple, and 

• for every rooted forest F of depth at most d and every L-structure S 
guarded by the closure of F, there exists an L-structure S such that S 
is an expansion of S and for every ui, . . . , u„ G V{S), 

S 1= ip{vo,Vi, . . . ,Vn) for some Vq G V{S) such that there exists a 
(wo, . . . , Vn) -admissible embedding ofT in F for S if and only if 

S \^Tpj.{vi,...,Vn), 
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where is the F -predecessor function and the relations Uf , . . . , can 
be computed (by listing the singletons they contain) in linear time given F 
and S. 

Proof. Fix a (/s-template T of depth at most d and let q be the T-predecessor 
function. Let X be the set of all terms appearing in (p. Let be the formula 
from Lemma [Tl] applied to the language obtained from L by adding the function 
symbol p. Finally, let K — max(A, c) + 1 where A is the maximum degree of T 
and c is the number of components of T. 

Let t — f{xi) be a term appearing in for some function symbol f € and 
a variable Xi with < i < n. If axit) is neither an ancestor nor a descendant 
of axixi), then for every rooted forest F of depth at most d, every L-structure 
S guarded by the closure of F and every choice of ui, . . . , t;„ S ^(5), there is 
no {vo, ■ ■ ■ , ?;n)-admissible embedding for 5 of T into F, because vt and f^{vi) 
are adjacent in the Gaifmann graph of S; in particular, one is a descendent of 
the other in F. Hence, we can set Tpj. to _L. So, we can assume the following: 

The images under ax of all function images of each variable Xi 
are ancestors or descendants of axixi). 

If arixo) is an ancestor of a vertex arit), say q''{aT{t)) = arixo) for 
k > 0, where t is a term such that xq does not appear in t, then Tpj' will be 
the formula obtained from ip A by replacing each Xq with the term p^{t). 
Clearly, S \= lpj'{vi, . . . , u„) if and only if there is a choice of vq in V{F) such 
that S \= (p(vo, . . . , Vn) and there is a (wq, . . . , i'n)-admissible embedding of T in 
F for S. So, we can assume the following: 

Every term t such that axit) is contained in the subtree of T , , 
rooted at arixo) is xq or a function image oi xq. ^ 

We now define an auxiliary formula ip' to be the formula obtained from ip 
by replacing all subformulas of the form: 

• t — t' , where t and t' are terms such that axit) ^ axit'), 

• t =i t' , where t and t' are terms such that axit) = ax{t'), and 

• . . . , trn) such that ariti), . . . , aT{tm) are not vertices of a clique in 
the closure of T, 

by _L. Such a subformula is not satisfied for any choice of vo for which there 
exists a {vq, . . . , -y„)-admissible embedding of T in F for S. It follows that for 
every vq such that there is a (vq, • ■ ■ , Wn)-admissible embedding of T in for 5, 
S \= (p{vo, Vn) if and only ii S \= (fi'ivQ, . . . , w„). 

Suppose first that the tree of T containing the vertex axixo) also contains 
an QfT-image of another variable. Let v be the nearest ancestor of axixo) in T 
such that there exists a term t„ e X such that xq does not appear in ty and v 
is an ancestor of ar(t„). Note that v ^ arCa^o) by Let (i„ be the depth of 
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V in T, the depth of aT{x{j) and m the number of children of v in T. Let 
ti, . . . , tm-i be terms such that aT{ti), 1 < z < m — 1, are vertices of different 
subtrees rooted at a child of v and not containing aT{xo). Observe that the 
variable Xq does not appear in ii, . . . , by ([1]). 

Let Xq be the subset of X consisting of terms in which xq appears, and let 
To be the Xo-template obtained from T by taking the minimal rooted subtree 
containing aT{X{y) and the root of the tree containing arix^i), and restricting 
the function ar to Xq. Further, let X'^ be the subset of X consisting of Xq 
and the terms t such that arit) lies on the path between the root and aT(a^o)) 
and let Xg be the subset of Xq consisting of the terms mapped by ax to a 
descendant of v. 

We define a unary relation Ui [w) to be the set of elements w of F at depth 
dy + 1 such that the subtree of w in _F contains an element wq at depth dx^ (in 
F) with the following properties: 

• there is a (uo)-a'dmissible embedding of the template To in F for S, and 

• all clauses appearing in the conjunction ip' with terms from Xq only and 
with at least one term from Xq are true with = vq and the terms 
t G Xq \ Xo, say axit) = q''{aT{xo)), replaced with {p^)'^{vo). 

The relation Ui{w) can be computed as follows: for every element vq G V{S) at 
depth dxa of F, evaluate all terms in Xq and test whether the tree To and the 
rooted subtree of F containing the values of the terms are isomorphic as rooted 
trees (this can be done in time linear in the size of Tg which is constant). If 
they are isomorphic, evaluate the clauses in the conjunction (p' with terms from 
Xg only and with at least one term from Xg with the terms in Xg \ Xg replaced 
with {p^)''{vq). If all of them are true, add the predecessor w of i^o at depth 

+ 1 in T to J7i. Since the time spent by checking every vertex vo at depth 
dxo of F is constant, the time needed to compute [/i is linear. 

We further define a unary relation Ui{w), 2<j<m + l, tobe the set of 
elements w of T at depth dy such that Ui{w') is true for at least i — 1 children 
w' of w. Clearly, the relations Ui{w), 2 < i < m + 1, can be computed in linear 
time when the relation Ui has been determined. 

Let if" be the formula obtained from ip' by removing all clauses with terms 
from Xq only that contain at least one term from Xq . Observe that if t is a 
term in ip" such that xq appears in t, i.e., t G Xo \ Xq , then arit) lies on 
the path between v and the root. Let <p'" be the formula obtained from ip" by 
replacing every term t, in which xq appears, with p^{tv), where k is the integer 
such that arit) = q'^iaritv))- Let T' be the (X \ Xo)-template obtained from 
T by taking the minimal rooted subtree containing ariX \ Xg) and restricting 
the function ut to X \ Xg. The formula will then be the conjunction of the 
following formulas: 

• the formula ip'" (x\^ . . . , a;„), 

• the formula ^t' from Lemma [TT] for the (X \ Xo)-template T', and 
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• the formulas 



for all subsets Y of the set {1, ... ,m — 1} where k is the integer such 
that q''{aT{tv)) = v and fcj, i = l,...,m — 1, are the integers such that 

q'-'^iariU)) = V. 

If 5 1= '(fj'{vi, . . . , Vn), then there is a {vi, . . . , t;„)-adniissible embedding of 
T' in F and the vertex v = p'^{tv) has a son w such that J7i(w) is true and 
the subtree of F rooted in w docs not contain the vahic of any term appearing 
in X \ Xq (this is guaranteed by the last type of formulas in the definition 
of ^y). In particular, the subtree rooted in w contains a vertex vq such that 
the (vi, . . . , ?;„)- admissible embedding of T' can be extended to a {vq, . . . , ?;„)- 
admissible embedding of T in F for S and all clauses in the conjuction (f' 
containing terms from Xq are satisfied with xq = vq- Since S \= ip' {vi, . . . ,Vn), 
it follows that S \= </?(wo, ■ ■ ■ ,Vn). The argument that the existence of vq such 
that S \= <f{vo, ■ ■ ■ , Vn) and the existence of a (wq, • • • i i'n)-admissible embedding 
of T in F for S implies that 5 \= Tp j,[vi, . . . , w„) follows the same lines. 

The case that the tree of T that contains the vertex axixQ) does not contain 
aT-image of another variable is handled similarly. In this case, the predicate 
U\ is defined for the roots of the trees of F, and the predicates U2, ■ ■ ■ , Um+i 
(where m is the number of components of T) are nuUary predicates such that 
Ui is true if t/i is satisfied for at least i — 1 roots of the trees in F. □ 

We now prove the lemma that forms the core of our first algorithm. 

Lemma 13 (Quantifier elimination lemma). Let d > be an integer, L a lan- 
guage and ^{xi, . . . , Xn) a simple L-formula of the form 3xo (p'{xo, • ■ • , Xn) such 
that ip'{xo, . . . , Xn) is a quantifier-free L-formula with free variables xo, . . . ,Xn- 
There exist a language L and an L-formula Tp such that the following holds: 

• L is obtained from, L by adding a function symbol p and finitely many 
relation symbols Ui, . . . ,Uk of arity at most one, 

• ip is quantifier-free and the variables the only variables that 
appear in Tp, but Tp need not be simple, and 

• for every rooted forest F of depth at most d and every L-structure S 
guarded by the closure of F, there exists an L-structure S with V{S) = 
V{S) such that S is an expansion of S and for every vi,. . . ,Vn G V{S), 

S \= ip{vi,. ..,Vn) if and only if S \= Tp{vi, . . . , w„) 

where p^ is the F -predecessor function and the relations Uf , . . . , can 
be computed (by listing the singletons they contain) in linear time given F 
and S. 
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Proof. Let d, L and (p' be fixed. We assume without loss of generality that the 
formula ip' is in the disjunctive normal form and all the variables xq, . . . ,Xn 
appear in ip' . Let F be a rooted forest of depth at most d, and let S be an 
L-structure. 

The proof proceeds by induction on the length of ip' . If <p' is a disjunction of 
two or more conjuctions, i.e., cp' — pi V p2, we apply induction to the formulas 
3a;o<^i and 3x0^^2- We obtain languages Li and L2, and for i = 1,2 an Li- 
formula '^p^ and an Li-structure Si. We assume that the new unary relation 

symbols of Li and L2 are distinct and set IT — LIUL2, = l{ = L2 = U{p} 
and Tp ^Tp-^ V^2- We define the L-structure S by V{S) ~ V{S) and by taking 
the interpretations of symbols from 5*1 and S2 . 

Thus in the remainder of the proof we may assume that p' is a conjunction. 
Let vi,V2, ■ . ■ ,Vn e V{S). By Lemma [10] we have S [= p{vi, . . . ,Vn) if and 
only if there exist vq G V{S) and a (^'-template T of depth at most d such 
that S \= p'{vo, ■ . ■ ,Vn) and there exists an embedding of T into F that is 
{vq, . . . , u„)-admissible for S. By Proposition [9] the number of (^'-templates of 
depth at most d is bounded by a function of p and d. By Lemma [T^ for every 
y)'-templatc T of depth at most d, there exist a language Lt, a quantifier- free 
LT'-formula px and an Lj^-structure St that is an expansion of S such that 
for every ui, . . . , u„ g y{S), St \= Pt{vi, . ■ . , w„) if and only if there exists 
such that there is a (ug, ui, . . . , u„)-admissible embedding of T in L" for S and 
S \= p'{vo, ui, . . . , Vn). We may assume that for distinct (^'-templates T and T' , 
if a function or relation symbol belongs both Lt and Lt', then it belongs to L. 
Let L be the language consisting of all function and relation symbols of all Lt, 
let the formula p be obtained as the disjunction of the L-formulas IpT, where 
the disjunction runs over all choices of i^s'-templates T, and let the L-structure 
5* be obtained by taking the union of the interpretations of all St- Then L, Tp 
and S are as desired. □ 

In order to apply Lemma I13[ the formula needs to be simple but the lemma 
produces a formula that need not be simple. The following lemma copes with 
this issue: 

Lemma 14. Let L be a language and p{xi, . . . , x„) an L-formula with q quanti- 
fiers. There exists a language L' that extends L, a simple L' -formula p'{xi, . . . , Xn) 
with q quantifiers and an integer k with the following properties. For every 
L-structure A guarded by a graph G, there exists an L' -structure A' guarded 
by a k-th augmentation G' of G such that A \= p{vi, . . . ,Vn) if and only if 
A' \= p>' {vi, . . . ,Vn) for any Vi,...,Vn € V{A) = V{A'). Moreover, an L' - 
structure A' and a k-th augmentation G' satisfying the above specifications can 
be computed in time 0{\V{G')\ + |L;(G")I)- 

Proof. We may assume that p> is not simple, for otherwise there is nothing to 
prove. Let / and g be function symbols of L such that the L-term g{f{t)) 
appears in p> for some L-term t. Let Li be the extension of L obtained by 
adding a new function symbol h, and for an L-structure A we define an Li- 
structure Ai as the expansion of A, where the interpretation of h is defined by 
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/i^i(ii) — g^{f^{v)) for all v G V{A). Let (pi be obtained from ip by replacing 
the appearance of g{f{t)) by h(t). Then clearly A \= ip{vi, . . . , w„) if and only if 
Ai \= ip'{vi, . . . , Vn) for all til, . . . , ?;„ e ^(^) = l^(^i)- Let D' be an orientation 
of G of maximum in-degree 2Vo(G'), and let D be obtained from D' by adding 
all directed edges with head v and tail f{v) and all directed edges with head 
V and tail g{v). Finally, let Gi be the augmentation of D. Then Gi is a first 
augmentation of G and Ai is guarded by Gi. By repeating this construction 
at most k times, where k depends only on (/?, we arrive at a desired formula if' 
and fc-th augmentation G" of G. It is clear that both can be computed in time 
0{\ViG')\ + \E{G')\). □ 

We are now ready to prove Theorem |31 we prove it in a stronger form needed 
in Section m 

Theorem 15. Let Q be a class of graphs with bounded expansion, L a language 
and (p{xi, . . . , Xn) an L-formula. There exists a language L' , an integer m and a 
linear-time algorithm that given an L-structure A guarded by G ^ Q computes an 
m-th augmentation G' of G, an L' -structure A' guarded by G' , and a quantifier- 
free L' -formula (p'{xi, . . . , Xn) such that for all vi, . . . ,Vn £ V{A) = V{A') 

A 1= ip{vi, . . . ,u„) if and only if A' |= ip'{vi, . . . ,u„). 

In particular, if n — 0, the algorithm decides whether A\= tp. 

Proof of Theorems\^ and\15[ It suffices to show how to compute G' , A' , p' sat- 
isfying the specifications of the theorem, except that Lp' has one fewer quantifier 
than p. A proof of the theorem is then obtained by iterating this algorithm. 

If p> is quantifier-free, then there is nothing to prove. Hence, assume that p) 
contains at least one quantifier. By Lemma[T3]we may assume that p is simple. 

Since Vx ^ is equivalent to -i3a; -1-0, we can assume that p contains a sub- 
formula of the form Bxq^, where i/i is a formula with variables xq, xi, . . . , xat 
and with no quantifiers. Let iVo be the number + 1 increased by the number 
of distinct function images of xq, xi, . . . , x-^ appearing in t/i. Let K = ZN^ -\- 1 
and consider a coloring of a K-\}a. augmentation G' of G; this coloring is a 
depth-certifying low-tree-depth coloring of G of order iVg (see the discussion be- 
fore Theorem [7]). Let Kq be the number of colors used by this coloring and Gi, 
i = 1, . . . , Kq, a unary relation containing vertices with the i-th color. Note that 
Kq is bounded by a constant depending on Q and K only. Clearly, computing 
the augmentation and determining the coloring can be performed in linear time. 

For each function symbol / G L-^ and i = 1, . . . , Kq, let G/ be the predicate 
defined so that G/ (v) is true for v G l^(^) if and only if the color of f(v) is 
i. The colors of the variables and their function images appearing in -0 can be 
described by an iVo-tuple a of numbers between 1 and Kq. Let A be the set of 
all such A^o-tuples. For a G A, let pa be the conjunction of the terms of the 
form Gd- [xi) and G^. [xi) that verifies that the assignment of the colors of the 
values of xq, . . . ,X]^ and their function images are consistent with a. Clearly, 
■^xqiI) is equivalent to the disjunction of the L- formulas ^xq{iI) hp a), where the 
disjunction ranges through all a G A. 
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For each a G A let La be the language with function symbols fa for all 
f £ and relation symbols Ra for all R G L' . For an L-structure A and 
a G A we define an Lc-structure Aa by fa''{^) — /^(^) if both v and f"^{v) 
have a color in a and by (u) = u otherwise; and for each relation symbol 
R& we let let Ra" be defined by restricting R"^ to the vertices with color a. 
Let ipa be the La-formula obtained from 3xo{'4' /\^a) by replacing each function 
symbol / by and each relation symbol R by Ra- For a G A, the formula 
3xo{tp /\ ipa) is true for A if and only if ip'^ is true for Aa- Note that Aq, is 
guarded by the graph Ha obtained from G by removing the edges incident with 
the colors not in a. Since 3Nq + 1 < K, G' contains an out-branching Fa of 
depth at most A'o whose closure contains Ha- 

For each a G A we apply Lemma [13] to d replaced by iVo , the language La , 
formula (pa, La-structure Aa and rooted forest Fa, obtaining a language La, a 
formula ipa and a structure A'^ such that 

h vli^i^- ■ -.vn) if and only if A'a \= ipaivi, - - -,vn) 

for all t;i,...,UAr G V{Aa) = V{A). 

Let L' be the language obtained by taking the union of L and all La over 
a G A, and let cp' be the L'-formula obtained from ip by replacing the subformula 
3xo^p by the disjunction of the La-formulas ipa for all choices of a G A. Let 
A' be the L-structure obtained by taking the interpretations of symbols from A 
and Aa for all a G A. Then A' is guarded by the graph G", and contains one less 
quantifier than ip, as required. Since each application of Lemmas [13] and [14] can 
be performed in linear time and the number of their applications is bounded by 
a function depending on ip and Q only, the reduction of (p to ip' and computing 
G" and A' can be done in linear time. □ 



4 Deciding FOL properties in graphs with lo- 
cally bounded expansion 

The following theorem is implicit in 15 (also see |18)). 

Theorem 16. Let Q be a class of graphs and for an integer d > let Qd he the 
class of graphs consisting of all induced subgraphs of d-neighborhoods of graphs 
in G- Let Qd be nowhere dense for all integers d > 0. Furthermore, let L be a 
language and L' the language obtained from L by adding a new binary relation 
symbol- Suppose that for every d and every L' -formula <p'{x), there exists a 
linear-time algorithm that lists all elements v of an input L' -structure guarded by 
a graph from Qd that satisfy tp'{v). Then, for every L-sentence ip there exists an 
almost linear-time algorithm that decides whether an input L-structure guarded 
by a graph from Q satisfies ip- 

Proof- The proof of [TS] Theorem 1.2] can serve as a proof of this theorem with 
the following modifications. The proof in [15] relies on Lemma 4.4, Corollary 6.3, 
Corollary 8.2 and Lemma 8.3 from the same paper, and those assume that Q has 
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"bounded local tree- width" . However, the use of Lemma 4.4 can be replaced by 
the hypothesis of our theorem, the conclusion of Lemma 8.3 can be deduced from 
the hypothesis of our theorem, and the conclusions of the remaining statements 
follow from the fact that every L-structure A guarded by a member of Qd satisfies 
\A\ = 0{\V{A)\^~^'^) for every e > 0. This, in turn, follows from the fact that 
for every e > 0, every subgraph of every member of Qd bas a vertex of degree at 
most ol\V{H)\'). □ 

Proof of Corollary^ Let L and ip be as in Corollary |4l In particular, the 
class Qd has bounded expansion for every d. By Theorem [151 ioi every integer d 
and every L'-formula ip'{x), there exists a language L" and a quantifier-free L"- 
formula ip"{x) such that every L'-structure A guarded by a graph from Qd can 
be transformed in linear time to an ^''-structure A' with V{A) = V{A') such 
that A \= (p'{v) if and only if A' \= (p"(v) for every v G V{A). In particular, it is 
possible to list in linear time all v G V{A) such that A' ^ <f"{v) since evaluating 
the latter formula requires constant time. So, the assumptions of Theorem [16] 
are satisfied. □ 



5 Dynamic data structures for Si-queries 

In this section, we provide a data structure for answering Si-queries. The 
update time is constant but the price we have to pay is that the graph that 
guards the relational structure must be fixed before the computation starts. 
Before we start our exposition, we need to introduce more definitions. 

Let L be a language with no function symbols. For an integer k, a k-labelled 
L-structure is an L-structure S with a partial injective mapping a : [l,fc] — >■ 
V{S), i.e., a need not be defined for all integers between 1 and k. In our further 
consideration, we will also allow k to be equal to zero. 

The trunk of a A:-labelled L-structure S is the L-structure obtained from 
S by removing all relations with elements only from q;([1,A:]). A /c-labelled L- 
structure S is hollow if S is equal to its trunk. Two A:-labelled L-structures 5*1 
and 5*2 arc k-isomorphic if their trunks are isomorphic through an isomorphism 
commuting with mappings ai and a2- In particular, every fc-labelled L-structure 
is /c-isomorphic to its trunk. 

Suppose now that an L-structure S is guarded by the closure of a rooted 
tree T. The depth of a root of T is zero, and the depth of every other vertex 
of T is the depth of its immediate predecessor plus one. For a vertex v of 
T at depth d, let Pt{v) denote the vertex-set of the path from the root of T 
to V and T{v) the elements the subtree of v (including v itself). Then, S{v) 
denotes the set of all c?-labelled L-structures S' such that S' is an induced 
substructure of S with elements only in Priv) U T{v) and a{i) = w for every 
vertex w G Pt{v) D V{S') — {v} at depth i — 1. If a vertex of Pt{v) at depth 
i — 1 is not contained in S", then a{i) is not defined. 

We are now ready to prove a lemma that contains the core of our data 
structure. 
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Lemma 17. Let L he a language with no Junction symbols, do a fixed integer and 
F a rooted forest of depth at most do. There exists a data structure representing 
an L-structure S guarded by the closure of F such that 

• the data structure is initialized in linear time, 

• the data structure representing an L-structure S can be changed to the one 
representing an L-structure S' by adding or removing a tuple from one of 
the relations in constant time provided that both S and S' are guarded by 

the closure of F , and 

• the data structure decides in time bounded by 0(|<^|) whether a given Si-L- 
sentence (p with at most do variables is satisfied by S, and if so, it outputs 
one of the satisfying assignments. 

Proof. For every vertex u of F at depth d, we will store the following two lists: 

• the Ust of all relations from S that contain v and all their elements are in 
Pf{v), and 

• the list of all (non-d-isomorphic) rf-labcllcd hollow i-structurcs with at 
most do elements that are d-isomorphic to a d-labelled L-structure con- 
tained in S{v). 

Since there are only finitely many non-d-isomorphic d-labelled //-structures with 

at most do elements for every d < do, the length of each list of the second type 
is bounded by a constant depending only on do and L. If is a non-leaf vertex 
of F, there will be a third list associated with v: 

• the list of all (non-isomorphic) {d+ l)-labelled hollow i-structures S' with 
at most do elements that appear in the second list of at least one child of 
v; for each such S', there will be stored the list of all children of v whose 
second list contains S'. 

In addition, there will be a global list of all (non-isomorphic) L-structures with 
at most do elements that appear as induced L-substructures in S. 

Let us describe how all these lists are initialized. The initialization of the 
first type of lists is trivial: just put each relation to the list of its element that is 
farthest from the root. This can clearly be done in constant time per relation. 

Initialization of other types of lists is more difficult. Fix a tree T of F. We 
proceed from the leaves towards the root of T. Let w be a vertex of T at depth 
d. If w is a leaf of T at depth d, then the second list of v contains only those 
hollow d-labellcd L-structurcs S" with V{S') C Pt{v) such that if v e V{S'), 
then S" contains precisely all relations of S induced by V{S') and containing v, 
and if V ^ ^('^')> then S' contains no relations at all. 

Suppose now that v is not a leaf of T. The third list associated with v can be 
initialized by merging the second type of lists of children of v. We describe how 
it can be decided whether a d-labelled hollow L-structure S' should be contained 
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in the list of v of the second type. Assume that S{v) contains a rf-labelled hollow 
L-structure S" that is d- isomorphic to S' . 

Then, V{S") can be decomposed into disjoint subsets Vo,V\, . . . ,Vm such 
that Vo = V{S") n Pt{v), each of the sets Vi, i = 1, . . . ,m, is fully contained 
in a subtree of a child Vi of v, and different subsets Vi, . . . , Vm are contained 
in different substrees. Observe that every relation of S" must be contained in 
Vo U Vi for some i = 1, .... m. Moreover, the only relations of S" contained in 
Vb are those that contain v. 

Hence, the existence of S" can be tested by considering all partitions of V{S') 
into disjoint subsets Vo, Vi, . . . , Vm such that a{[l, d]) C Vb, |Vb \ «([!, < 1 
and every relation of S" is contained in Vb U Vi for some i = 1, . . . ,m, and 
then testing the existence of children vi,. . . ,Vm such that the second list of Vi 
contains a. {d+ l)-labellcd hollow L-structurc {d+ l)-isomorphic to the {d+ 1)- 
labelled hollow L-structure of 5" induced by Vb U V^; if |Vb \ q;([1, = 1, then 
a{d+l) is defined to be equal to the unique element of Vb \ q:([1, rf]) and we also 
test whether the relations of S' containing a(d+l) are precisely those relations 
of S restricted to Pt{v) that contain v, i.e., those in the first list of v. 

We now describe how to test the existence of children v\,. . . ,Vm- Let W 
be the set of children of v such that: if v has at most m children with their 
second list containing a (d + l)-labelled hollow L-structure {d + l)-isomorphic 
to the substructure of S' induced by Vq U Vj, then W contains all such children 
of w. If w has more than m such children, then W contains arbitrary m of these 
children. Clearly, \W\ < rn? < d^. In order to test the existence of such children 
v\,...,Vm oi V, we form an auxiliary bipartite subgraph B: one part of B is 
formed by numbers 1, . . . , m and the other part by children of v contained in 
W. A child w € W is joined to a number i if the second list of w contains a 
{d+ l)-labelled hollow L-structure {d+ l)-isomorphic to the substructure of S' 
induced by Vb U V^ . 

If B has a matching of size m, then this matching determines the choice 
of children Ui, . . . , On the other hand, if such children exist, B contains a 
matching of size to: indeed, if v,i ^ W, then v has at least to children whose 
second lists contains (d+ l)-labelled hollow L-structure {d+ l)-isomorphic to the 
substructure of S' induced by Vb U Vj, and we can change Vi to one of these m 
children that is different from Vii for i' ^ i. Hence, we can assume that Vi gW 
for every z = 1, . . . , to which implies that B has a matching of size to. 

Since the order of B is at most mP + m and the number of disjoint non- 
empty partitions of V{S') to Vb, . . . , Vm is bounded, testing the existence of a 
d-labelled hollow L-structure S" can be performed in constant time for v. 

It remains to construct the global list containing i-structures So with at 
most do elements that appear in S as induced substructures. We proceed simi- 
larly as when determining the lists of inner elements of the forest F. For every 
//-structure S' with at most do elements, we compute the list of trees of F that 
contain S", i.e., S' is contained in the second list of the root of F. Now, 5*0 is an 
induced substructure of S" if and only if there exist element-disjoint L-structures 
, . . . , Sm such that 5o = U • • • U Sm and Si,. . . , Sm appear in to mutually 
distinct trees of F. For each such partition of So into Si,. . . , Sm, we can test 
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whether 5'i , . . . , S,n appear in the Ust of roots of m distinct trees of F using the 
auxihary bipartite graph described earher. Since all structures involved contain 
at most do elements, this phase requires time linear in the number of trees of F. 

We have shown that the data structure can be initialized in linear time. 
Let us now focus on updating the structure and answering queries. Consider a 
tuple {vi, . . . ,Vk) that is added to a relation R or removed from a relation R. 
Let r be the root of a tree in F that contains all the elements vi, . . . ,Vk and 
assume that vi, . . . ,Vk appear in this order on a path from r. By the definition, 
the only lists affected by the change are those associated with vertices on the 
path P{vk)- Recomputing each of these lists requires constant time (we proceed 
in the same way as in the initialization phase except we do not have to run 
through the children of the vertices on the path to determine which of them 
contain particular /c-labelled hollow L-substructure 5" in their lists). Since the 
number of vertices on the path P{vk) is at most do, updating the data structure 
requires constant time only. 

It remains to describe how queries are answered. Let be a Ei-sentence with 
d < do variables. We generate all possible L-structures So with |y(5'o)| = d and 
check whether they satisfy the formula (p. Let SSq be the set of those satisfying 
(f. The set SSo can be generated in time 0(|(/?[) since L and do are fixed. 

Observe that S satisfies (p if and only if it has an induced substructure iso- 
morphic to a structure in SSq. This can be tested in constant time by inspecting 
the global list. Providing the satisfying assignment can be done in constant time 
if during the computation for each substructure a certificate why it was included 
in the list is stored (which requires constant time overhead only). □ 

We are now ready to describe the data structures. We start with the one for 
graphs with bounded expansion. 

Theorem 18. Let L be a language with no function symbols, do a fixed integer 
and Q a class of graphs with hounded expansion. There exists a data structure 
representing a L-structure S such that 

• given a graph G E Q , the data structure is initialized in linear time with 
S being initially empty, 

• the data structure representing an L-structure S can be changed to the one 
representing an L-structure S' by adding or removing a tuple from one of 
the relations in constant time provided that both S and S' are guarded by 
G, and 

• the data structure decides in time bounded by 0{\ip\) whether a given Ei-L- 
sentence ip with at most do variables is satisfied by S , and if so, it outputs 
one of the satisfying assignments. 

Proof. By Theorem [71 there exists a constant K (depending on Q only) such 
that for every G € Q we can find in linear time a low-tree-depth coloring of G 
of order do using at most K colors together with the depth-certifying forests. 
For every do color classes of G, we apply Lemma [TT] Since K is a, constant, the 
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number of data structures we maintain is bounded by a constant which depends 
on Q only. If the given sentence (p is satisfied, it is also satisfied in one of the 
unions of color classes. This is tested using the data structure from Lemma [T71 
If if is not satisfied in any of the auxiliary data structures, it is not satisfied in 
S either and we report that. □ 

The following is a variation of the above theorem for nowhere dense graphs. 

Theorem 19. Let L be a language with no function symbols, fco o, fixed integer, 
£ a positive real number and Q a class of nowhere- dense graphs. There exists a 
data structure representing an L-structure S such that 

• given a graph G ^ Q, the data structure is initialized in time 0{n^^^) with 
S being initially empty, 

• the data structure representing an L-structure S can be changed to the one 
representing an L-structure S' by adding or removing a tuple from one of 
the relations in time 0(n^) provided that both S and S' are guarded by G, 
and 

• the data structure decides in time bounded by 0(|(/j|) whether a given Ei-L- 
sentence ip with at most fcg variables is satisfied by S , and if so, it outputs 
one of the satisfying assignments. 

Proof. Let e > 0. The proof is based on the same data structure as Theorem [TSt 
but uses an extra ingredient. Since Q is nowhere dense, K is no longer a constant; 
instead, we may select K to satisfy K = 0{n'^^''°). In order to be able to answer 
queries in time O(n^) we compute all induced substructures S" of S satisfying 
[^("S")! ^ ^0- This can be done in time 0(7i^). To answer a query whether 
S \= (p we test whether S' \= ip for some induced substructure S' of S with 
[^(•S")! ^ ^0- Since ip is a Ei-sentence, this is a correct test for S \^ p. If 
a change to S is desired, then we make the corresponding change to the data 
structure from Theorem [18] and recompute the set of induced substructures in 
time O(n^). □ 



6 Dynamic data structure for FOL-properties 

In this section, we present our dynamic data structure for testing FOL proper- 
ties. The main result of this section reads as follows: 

Theorem 20. Let Q be a class of graphs with bounded expansion, L a language 
and p an L-sentence. There exists a data structure that is initialized with an 
n-vertex graph G G and an L-structure A guarded by G in time 0{n) and 
supports the following operations: 

• adding a tuple to a relation of A in constant time provided A stays guarded 
by G, 

• removing a tuple from a relation of A in constant time, and 
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• it answers in constant time whether A ip. 

Note that in Theorem we do not allow to change function values of 
functions from L to simplify our exposition; this does not present a loss of 
generality as one can model functions as binary relations. 

Theorem [201 follows from a dynamized version of Theorem [TS] (we state the 
theorem in the variant with no free variables for simplicity). 

Theorem 21. Let Q be a class of graphs with bounded expansion, L a lan- 
guage and if an L-sentence. There exists a language L' and a quantifier-free 
L' -sentence ip' and a data structure representing an L' -structure A' that can be 
initialized with an n-vertex graph G £ Q and an L-structure A guarded by G in 
time 0{n), V{A) = V{A'), and that satisfies: 

• A \= ip if and only if A' \^ (p' , in particular, testing whether A \^ ip can be 
performed in constant time, 

• adding a tuple to a relation of A can be done in constant time provided A 
stays guarded by G, and 

• removing a tuple from a relation of A can be done in constant time. 

In order to prove Theorem 1211 we first establish a dynamized version of 
Lemma [T^ 

Lemma 22. Let d > be an integer, L a language, ip(xo, . . . , Xn) a simple 
quantifier-free L-formula that is a conjucntion of atomic formulas and their 
negations, and T a ip-template. There exists an integer K and an L-formula 
Tprp such that the following holds: 

• L is the language with L — L"^ U {Ui, . . . , Uk} and L^ — L^ \J {p} where 
Ui, . . . ,Uk are new nullary or unary relations, k < K , 

• Tpj. is quantifier- free and the variables xi, . . . ,Xn are the only variables 
that appear freely in Tpj,, but Tpj. need not be simple, and 

• for every rooted forest F with depth at most d and every L-structure S 
guarded by the closure of F , there exists an L-structure S with V{S) = 
V{S) such that for every vi, . . . ,Vn £ V{S) 

S \= >f{vo, Vi, . . . , Vn) for some Vq G V{S) such that there exists a 
(wo, . . . ,Vn) -admissible embedding of T in F for S if and only if 
S Y= Vrivi, . . . ,w„) 

where p^ is the F -predecessor function and the relations Uf , . . . , can 
be computed (by listing the singletons they contain) in linear time given F 
and S. The interpretation of other symbols of L is preserved in S, and 
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• adding or removing a tuple to a relation of S results in adding and remov- 
ing a constant number of singletons from unary relations among Ui , . . . , 
and the changes to all relations Uf , . . . can be computed in constant 
time, provided S stays guarded by the closure of F. 

Proof. We need to describe how the relations C/f , . . . can be updated in 
constant time after adding or removing a tuple to a relation of S. Let us 
consider in more detail the case analyzed in the proof of Lemma [12] and leave to 
the reader the case mentioned at the end of the proof of Lemma [121 Recall (see 
the proof of Lemma [T2l for notation) that Ui{w) is a unary relation containing 
elements w of F at depth d^ + 1 such that the subtree of it; in contains an 
element vq at depth dx„ (in F) with the following properties: 

• there is a (uo)-admissible embedding of the template Tq in F for S, and 

• all clauses appearing in the conjucntion ip' with terms from Xq only and 
with at least one term from Xq are true with xq = vq and the terms 
t E Xq \ Xq, say axit) — q'^{aT{xQ j), replaced with p^'''{va). 

Since none of the functions of 5' changes, the first condition cannot change when 
adding or removing a tuple to a relation of S. The second one can change only 
when a tuple containing a term from Xq with Xq = vq is added or removed 
from a relation. Since all the values of the terms in Xq with xq = vq appear 
only in a subtree of w, only a single element can be added to or removed from 
Ui. Based on the tuple we add or remove, we can identify which vu can be 
added or removed to Ui and using the data structure introduced in the proof 
of Lemma 1171 we can test in constant time the existence of vq satisfying the 
second condition (note that the values of all terms from Xq with xq ~ vq are in 
the subtree of w and the values of the terms in Xq \ Xq are on the path from w 
to the root). 

Once the relation C/i is updated, the relations U2, ■ ■ ■ ,Uk can be updated in 
constant time as well: keep a counter at every vertex at depth d^ determining 
the number of children in [/i. □ 

We are now ready to prove Theorem [5TJ 

Proof of Theorem[M\ Note that when the L-sentence (p is fixed in Theorem [T51 
the language L' and the i'-sentence ip' are also fixed. Hence, the only object 
that changes when the relations of A changes are relations in A'; the functions 
in A' stay the same and thus the m-th augmentation of the graph G from 
Theorem [15] that guards A' is independent of A as long as A stays guarded by 
G. 

We now have to inspect the proofs of Lemma [13] and Theorem [TS] in more 
detail. Since the graph G and all its augmentations appearing in the inductive 
proof of Theorem [15] stay the same, the coloring used in Lemma [T3l also does not 
change when A gets altered. In particular, at each step of the inductive proof 
of Theorem [151 every rooted forest F to which Lemma [13] is applied stays the 
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same. In the proof of Lemma [T31 we replace the use of Lemma [T^ with the use 
of Lemma [22] and observe that every change in S results in a constant number 
of changes in S and these changes can be identified in constant time. Hence, in 
the inductive proof of Theorem 1151 & single change in A results in constantly 
many changes to the structure obtained in the first inductive step, which result 
in constantly many changes to the structure obtained the second inductive step 
(each change in the structure obtained in the first inductive step yields only 
constantly many changes), etc. Since the time to update the final L'-structure 
A' is constant for each of constantly many choices that propagates through the 
induction from a single change of A, the overall update time is constant. □ 
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